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A NOVEL NUCLEIC ACID MOLECULE 
FJELD OF THE INVENTION 

The present invention is directed generally to an isolated nucleic acid molecule encompassing a 
neocentromere or a functional derivative thereof or a latent, synthetic or hybrid form thereof and 
its use inter alia in developing a range of eukaryotic artificial chromosomes including 
mammalian (e.g. human) and non-mammalian artificial chromosomes. Such artificial 
chromosomes are useful in a range of genetic therapies. 

BACKGROUND OF THE INVENTION 

Bibliographic details of the publications referred to by author in this specification are collected 
at the end of the description. 

The rapidly increasing sophistication of recombinant DNA technology is greatly facilitating 
research and development in the medical and allied health fields. A particularly important area 
is in mammalian including human genetics and the molecular mechanisms behind some genetic 
abnormalities. Progress in research in this area has been hampered by the lack of a cloned 
nucleic acid molecule encompassing a human centromere. The identification and cloning of a 
human centromere will promote the development of techniques for introducing genes into 
eukaryotic cells and in particular mammalian including human cells and will be an important asset 
to gene therapy and the development of a range of genetic diagnostic tests. 

25 The centrormre is an essential structure for sister chromatid cohesion and proper chromosomal 
segregation during mitotic and meiotic cell divisions. The centromere of the budding yeast 
Saccharomyces cerevisiae has been extensively studied and shown to be contained within a 
relatively short DNA segment of 125 bp that is organized into an 8-bp (CDEI) and 26-bp 
(CDEIH) domain, separated by a 78- to 87-bp, highly AT-rich, middle (CDE1I) domain (Clarke 

30 and Carbon, 1985). The centromere of the fission yeast Schizosaccharomyces pombe is 
considerably larger, ranging from 40 to 100 kb, ^nd consists of a central core DNA element of 
4 to 7 kb flanked on both sides by inverted repeat units (Steiner et al, 1993). Recently, the 
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pNAOankedbyaOORbofhigNy repeal sequences on one side (Murphy and Karpen. 1995, 
5 The mammanan centromere, like the centromeres o, an higher eukaryotes studied « . 

human centromere contains 2 to 4 Mb or tne v cnt( , mte 
1989 .99.; Trowel. « .993), p.us a smaUer and more variab,e ,uanut, of a 5-bp sate,h« 
m ONA (Orady .. a,. .992; Trowe,, « .993, The role o f these sate,h,e fences . 
,„ ;lly unclel Wecaon of a cloned n* un.ntem.pted *-sate,,e array into — 
Landd^. 1992) or a 120-kb a-saleffite-containing YAC into human 

wor^veanaly^gedY—^^ 

c_ of the human V chromosome with cloned telomeric DNA (Brown « al. 1994) and 
15 Iggested that .50 ,0 ,00 * of tr-satellite DNA plus -300 * of a*ace„t sequence, are 

ft. reaped 2.5 Mb of a-sateKte array has been produced by telomere-assoctaKd chromos me 
fragmentation (Parr « .995). m aU these studies, it is not known 
DNA sequences are embedded within the e_ric site and operate mdependemly of. or tn 
20 concert with, the a-satellite DNA. 

b mamma*, four constitutive centromere-binding protein, CENP-A, CENP-B. CENP-C. and 
CENP-D have bee* characterized to varying extents and impUc»ed ,0 have po^ble d,rec, roto 
in centromere function. CENP-A. a protein localized to fte outer kinetochore domain, .s a 
25 ce_specifie cor. histone tha, shows sequence homology to the histone H3 pro«in an 
«y serve to differentiate me centromere from the res. of the chromosome a. ^e most 

level of chromaun structure - the nudeosome (SuUivan . -L .994, 
pro ,ein which associates with the centrorrenc teterochromaun through ,ts bmdmg ,0 the CENP- 
B box motif found in primate — e and mouse minor satellite DNA. probab., has a role in 

LTuTprotein is undetectable on the Y chromosome (Pluta cel.. 1990, and „ found on the 
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inactive centromeres of dicentric chromosomes (Earnshaw et al, 1989). CENP-C has been 
shown to be located at the inner kinetochore plate and is postulated to have an essential although 
yet undetermined centromere function, as seen, for example, from inhibition of mitotic 
progression following microinjection of anti-CENP-C antibodies into cells (Bernat et al, 1990; 
5 Tomkiel et al, 1994) and from its association with the active but not the inactive centromeres 
of dicentric chromosomes (Earnshaw et al, 1989; Page et al, 1995; Sullivan and Schwartz, 
1995). Finally, CENP-D (or RCC1) is a guanine exchange factor that appears to have a general 
cellular role that is neither specific nor clear for the centromere (Kingwell and Rattner 1987; 
Bischoff et al t 1990; Dasso, 1993). More recently, a new role for the mammalian centromere 

10 as a "marshalling station" for a host of passenger proteins" (such as INCENPs, MCAK, CENP- 
E, CENP-F, 3F3/2 antigens, and cytoplasmic dynein), has been recognized (reviewed by 
Earnshaw and Mackay, 1994, and Pluta et al., 1995). These passenger proteins, whose 
appearance at the centromere is transient and tightly regulated by the cell cycle, provide vital 
functions that include motor movement of chromosomes, modulation of spindle dynamics, 

15 nuclear organization, intercellular bridge structure and function, sister chromatid cohesion and 
release, and cytokinesis. At present, except for CENP-B, none of the constitutive or passenger 
proteins have been demonstrated to bind mammalian centromere DN A directly. 

In work leading up to the present invention, the inventors identified in a patient (hereinafter 
20 referred to as "BE") an unusual human marker chromosome, mardel 10, which is 100% stable 
in mitotic division both in patient BE and in established fibroblast and transformed lymphoblast 
cultures. In accordance with the present invention, a region of the mardel (10) chromosome has 
been cloned together with the corresponding region from a normal human subject. The nucleic 
acid molecules cloned contain no substantial a-satellite repeats yet are mitotically stable. The 
25 nucleic acid molecules encompass therefore, a new form of centromere referred to herein as a 
"neocentromere". The identification and cloning of a eukaryotic neocentromere without 
substantial a-satellite DNA repeat sequences now provides the means of generating a range of 
eukaryotic artificial chromosomes such as mammalian including human artificial chromosomes 
with uses in genetic therapy, transgenic plant and animal production and recombinant protein 
30 production. A range of diagnostic reagents is now also obtainable using the cloned 
neocentromere . 
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SUMMARY OF THE INVENTION 

Seance Identity Numbers (SEQ ID NO,) for the nucleotide seances referred to in the 
specification are defined following the bibliography. 

5 Throughout this specification and to^^^.^**^^*^ 

integer or group of integers. 

' 0 A fibrobUs, ceU Une 92 01 58 curving *c marde! marter chromosome was deposited a, Che 
1 0*- - C* ~ (BCACC, 0- *>r Appiied R~* 

^^o— (BACs> carrying portions of Ore ma*, (.0, chromosome have aUc been 
15 deposited at ECACC as follows: 

BAC/E8-1: deposited on 5 May 1998 under Accession Number 980505016; 
BAC/F2-14: deposited on 5 May 1998 under Accession Number 980505017. 

20 A number of human fibrosarcoma cell lines carrying various neocentromeric constructs were 
Iposited at ECACC as described hereafter by Accession Number with the date of deposU . 
parenthesises. 

HT _ 38 98050704 (7 May 1998) 

25 98050705 (7 May 1998) 

HX _ 54 98050706 (7 May 1998) 

m _l 90 98050707 (7 May 1998) 

HT . 191 98050708 (7 May 1998). 

30 One aspec. of ft. P-sen, inversion provides a- isofcKd nnc.eic acid mo.ecuie comprising 
sequence of nucieoddes derived from a erotic chromosome and encompassing 
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„ or a l«W derivative synthetic or hybrid form thereof which nucleic acid 



division 

5 



Another aspect of the present invention contempt a nucleic acid molecule or ,ts chemical 
This a terfar, struck which defines a human neocene or a fcncttona, 

:::: zz , « ^ — OI , — « - 

mammalian homologue. 

0 ve, a farther aspect of the present — is directed to an isoiated nucleic acid moiecuie 

extn^hromosomal element which segregates with cell divisron. 

' 5 S ,i„ another aspect of me present invention is directed to an isolated nucleic acid — 
h lg a sequence of nucleotides or Mr chemica! equivalents which directs aeon— 

HLoforamar^ornon-n^homo^ 
does not contain substantial a-sateUite DN A repeat sequences. 

A further aspect of the present — is directed to an isolated nucleic acid molecule 
Imping a nucleotide seance encompassing a neoc_e or a fcnctronal de_ 
25 syntL o hybrid formtW which when said nucleic acidmolecule is in Imear form an c. 

into a ceU together with a telomeric sequence, is capable of rephcaong. — g as 
» „„*««.n»l aement and segregates with cell divis.on. 

Another aspect of the p^sen, invention provides an isolated nudeic acid module or a 
30 deriv.dve.s,nu™ucorhybridformthe,eofcom P risingase,»e„ceofnucleot,des ; 

7 whichdi™form»Uondefi m ngahum^^^ 



- 6- 



, i • j fnrrn rh-reof or its mammalian or non-mammalian 
thereof or a latent, synthetic or hybnd form thereof or .ts m 

nomologue wherein said neocentromere is capable of associates w* CENP-A and 
CENP-C; 

5 oi ^^capabte.^nu.Uoducedin.oco^b.eceUs.ofrepUcauon.re^nmgex.ra. 
chromosomal and segregating with cell division. 

Even ye, another aspect of 1* present invention is directed W a genetic constat comprismgan 
origin of relation for a eukaryottc ceU and a nucleic acid module encompassuig a human 
,0 nlntroml or a — derivative thereof or a iaten, synthetic or hybridform thereof or 
or no,mammahan homoiogue fianKed by Homeric nucleoude : „ 
functional in the cell in which me genetic consume, is to repiica* an* wheretn sard g«^ 
construe, when introduced into a ceU is a repiicing. e«ra-c_»al element w,uch 
segregates with cell division. 

15 Another asp*, of the present invention is direct*. ,0 a genedc construct in the form of a 
e^ric artificial chrotnosome such as a mammalian artificial chromosome (MAC), a human 
artificial chromosome (HAC) or comprising an origin of replication and a sequence of 

nucleotides which: 

20 (i) ^aconforn^ndefiningahununnec^tromereoramncuonalder^ 

or a latent, syndic or hybrid form thereof wherein said neoccntromere is capable of 
associating with CENP-A and CENP-C or antibodies .hereto; and 
(„) contains no substantial a-sauuliteDNA repeat sequences; 

sequence of nucleotides flanked by eukaryotic (e.g. mammalian, telomeric sequences whtch 
25 axe intum uanW b, yeas, .Corner* sequels wherein a unique eniyme sue is located between 
the human and yeas. Homeric nudeoude sequences such that upon contact with sard enzyme, 
thc yeas, Homeric sequences are removed «rt ,he eulcaryouc (e.g. maramahan) telomenc 
sequences are exposed. 

30 Sdl. anomer aspect of the present invendon provides a genetic construe, comprising an origin 
ofrepBcauon and a fns, nucleic acid molecule defining a human »eccen<romere or a functional 
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derivative thereof or M syntheuc or hybrid form .hereof, a seed nucieic acid tnoiecuie 
1 flankedTy a *. set of .uka^otic (e.g. a—, such as human, — « sequences 

5 H are u»que enzyme sites between the Hrs, and second Homeric sequences such that upon 
5 rr^arJrede^e.U.esecond.e.orneric sequences are cieaved off to expose the 

first telomeric sequences. 

BRIEF DESCRIPTION OF THE FIGURES 

10 Figur e a is a schematic represent shoeing identification of a VAC contig spannin* 
J te ce„tromere region. (A) Comparison of GTL banding patterns of mardel iOandnorma. 

^crating the Iter chromosome (Vounaire er „, .9,3). The iong and shot, - o 

p arms of Ore norma! ch_e ,a Asterisk denotes the position of 

was use* to "ug» the q'-arm of sketched marker chromosomes ut the ANTI-CEN/FISH 

20 1 shown. (C, FISH mapping of seiected VAC ciones from conug *»2 us-ng norm* 
Lescence microscopy and standard metaphase chromosomes prepay, f.m — 
phobias, ecus o, patient BE. The distribution o, FISH signais (veruca. axrs, - *ow * 

Tn the oppose arm of me chromosome. The toui numoer of florescence stgnais scored for 
25 each of the YAC clones is indicated in brackets. 

Figure 2 is a photographic representation showing ANTI-CEN/FISH an*** o, the marker 
celtromere. (A) Detection of a-sateilite ON A using a mixture of 

signais) under iow stringency conditions. Centromeres were counter-iabeUed «k CRESTS 
30 LbL- antibody (paie biue dots ; or white when superimposed on a r^ background, 
c™™ were prepared from transformed iymphobiast ceUs o, pauen. BE. The nght-hand 
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p-d represents green pseudo-coloring of DAPI images of chromes to prov.de aba** 
Lition of chromosome oudine. Only the signal for the -body. bu. no, *« for 
was seen on Ore marker centromere (arrowed, . (B) Simultaneous « o, stretcl^ human 
metaphase chromosomes » CKEST* (red) and and-CENP-C andbody. Am-C, ** b.ue, 

5 11 white color indicating *dl coincidence of the t wo antibody signals. (C, Detection of 
CENP-C on the marker chromosome. Simultaneous labelling of me marker chromosome 
(arrowhead) witi, (a) Am-Cl (pale blue) artd (b) CRESW6 (red), (c) Combined images of a and 
b showing complete coincidence of Am-Cl and CRESTA6 signals, (d) FISH analysis of the same 
cell as a-c using the >0pC38 cosmid probe (pate blue dots and green arrows, to idem,* the 

,0 markerchromosome. Some loss of ANTI-CEN signa!. especially for the Am-Cl antibody was 
seen following FISH. (.) Green pseudo^oloring of DAPI image, A colour photograph 
corresponding to this figure is available upon request. 

Figure 3 is a photographic representation showing ANI1-CEN/FISH analysis of cosmid clones 
,5 on stretched (A. a-0 and supe.stie.ched (B) n*.aphase chromosomes, (a-c, Examples of cosrmd 
signals (white arrows) .ocalized «o the a'-region of d» marker centromere, (d-f) Examples of 
cosnud signals (white arrows) locaEzed to the p-region of me marker cen<ron*re. Green arrows 
indicate positions of the 10»C3t cosmid DNA tag used to mark the ,-end of the marker 
chromosome. (B, Mapping of Y6C21 onto a supers.re.ched metaphase chromosome. No. 
20 inc.uded is me 10pC3S q ,.ag signal loca,ed further to me left of me 

show. ANTI-CEN signals are in red, FISH signals are in pale blue, and overlapptng ANTI-CEN 
and FISH signals are in white. Each of the pictures is accompanied by DAPI images of 
chromosomes pseudo-coloured in green. A colour photograph corresponding .0 mis figure ,s 
available upon request. 

25 figure 4 Localization of the anti-centromere antibody-binding domain, a, Relative positions of 
different cosmid and PAC clones within the YAC #082 contig, using YAC-3 as a reference. 
Cosmids are designated as YnCm, where V denotes the YAC of origin and denotes the 
cosmid number. PACs 1-5 are five different PAC clones isolated from a human PAC library 

30 (Genome Systems toe). "HC-contig" represents a group of overlapping cosmids that map ughtly 
around the marker centromere in ANTI-CEN/FISH experiments. A genomic map corresponding 
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to the depicted YAC region was derived from the DNA of patient BE and shown above the YAC 
map S Sail; K, Kspl; N. Notl; Sf. Sfil. b. Cumulative scoring of FISH signals in ANTI- 
CEN/FISH experiments for cosmids Y3C64. Y6C8. Y3C94. Y7C14. Y4C45, Y6C10, Y6C21. 
Y3C3 PAC5, Y13C1. Y13C8, and Y17C6. The distribution of FISH signals (vertical axis) is 

5 those found on the opposite arm of the chromosome. The total number of fluorescence signals 
scored for each of the cosmid clones is indicated in brackets, c, Restriction mapping of the 80-kb 
region covered by the eight overlapping cosmids of the HC-conug. These eight cosmids were 
derived from four different YACs (YAC-3. YAC-4. YAC-6. and YAC-7) and provided 
independent confirmation of the map. Furthermore, the map agreed fully with the restriction map 

10 of a 120 kb-insert PAC clone (PAC4) that spanned the entire HC-contig region. E, EcoWi R. 
EcoRV; N, Notl. 

Figure 5 is a representation showing restriction analysis of genomic DNA of patient BE and 
those of his normal parents using Y6C10 as probe. DNA was resolved on a PFGE (A) or 
15 standard agarose gel (B and C). Samples 1, 2 and 3 were fibroblast cultures of mother of BE, 
father of BE. and patient BE. respectively. Sample 4 was a somatic hybrid cell line BE2C1-18-5F 
containing the marker chromosome. Fragment sizes are in kilobases. 

Figure 6 is a representation of the full nucleotide sequence of the HC-contig DNA derived from 
20 normal human chromosome lOq 25.2 region. 

Figure 7 is a diagrammatic representation of the method used to retrofit YAC3 and YAC5. 

Figures 8A to J are diagrammatic representations of the different vectors used for cloning 
25 DNA as YACs by the conventional restriction/ligation methods. 

Figures 9 is a diagrammatic representation of circular TAR summarising the recombination 
process. 

30 Figure 10 is a diagrammatic representation showing modification of TAR vector. 
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Figure 11 is a diagrammatic representation of the cloning of 10q25 human neocentromere 
DNA from mardel (10) chromosome. This DNA is designated NC-contig DNA to distinguish 
it from the HC^ontig derived from the corresponding region of the normal chromosome 10. 
(A) Structural map of the NC-contig region and flanking DNA. Arrows indicate the relative 

5 positions and directions of primers used inPCR analyses (Table 3). The restriction sites EcoRl, 
EcoRV, Srfl, and Sfil and Sftl are indicated by E, R. Sr and Sf. respectively. The position of 
the TAR "hook" CE-F2 is represented by the solid box. The hatched bar represents HC- or NC- 
contig. p' and q' refer to the short and long arms of mardel (10), respectively. (B) Circular 
TAR strategy using the vectors P VC39-Alu/C3-F2( + ) and P VC39-Alu/C3-F2(-) for the direct 

10 cloning of the neocentromere DNA from mardel (10). The position of the Alu consensus 
sequence hook is represented by the white box. Crosses denote the sites of recombination 
between the TAR vector and the genomic DNA at the Alu and C3-F2 hooks during cloning. 
(C) Structural maps of the resulting circular YACs 5f-52-E8 and 5f-38-F2 containing the 
neocentromere DNA of the mardel (10) chromosome. The DNA flanking the NC-contig is 

15 represented by stippled bars. (D) Structural maps of BAC/E8-1 and BAC/F2- 14. Nt represents 
Notl and URA-BAC-neo represents the retrofitting vector BRV1 (Larionov et aL. 1997). 

Figure 12 is a diagrammatic representation showing specific TAR of HC-region from mardel 
10. 

20 

The method was as follows: (1) Co-transformation into YPH857; (2) Select HIS* colonies; (3) 
screen for HC-region by PCR; (4) Prepare high-MW DNA; (5) Digest with I-Scel to expose 
hTELS; (6) Transfect HT 1080 cells; (7) Select for 0418*; and (8) analyse by PFGE and FISH. 

25 Figure 13 is a diagrammatic representation showing cloning in yeast as YAC/HAC. 

Figure 14 is a diagrammatic representation outlining TACT procedure. 

Figure 15 is a diagrammatic representation of TACT constructs. 

30 

Figure 16A is a representation of the full nucleotide sequence of the NC-contig DNA derived 
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ta. ^ ( W - coTespo^ds .0 ft. HC<o„.i 8 DNA region of „* norma! cr_e .0. 

^ » is . representor, of ,he pan* nuclide «— " - BACT2 " 14 *- *" 
U derived from a region irranediaKly p' of the NC-contig DNA (see Fig. UD>. 

5 
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SUMMARY OF SEQIDNOs. 



10 



15 



20 



25 



SEQ ID NO. 
1 

2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 



DESCRIPTION^ 

DN A primer 
DNA primer 

Nucleotide sequence of HC-contig 

Nucleotide sequence of NC-contig 

BAC-F2 contig 1 

BAC-F2 contig 2 

BAC-F2 contig 3 

BAC-F2 comig 4 

BAC-F2 contig 5 

BAC-F2 contig 6 

BAC-F2 contig 7 

BAC-F2 contig 8 

BAC-F2 contig 9 

BAC-F2 contig 15 

BAC-F2 contig 33 

BAC-F2 contig 39 

BAC-F2 contig 41 

BAC-F2 contig 42 

BAC-F2 contig 44 

BAC-F2 contig 47 

BAC-F2 contig 47 fragment 1 

BAC-F2 contig 47 fragment 2 

BAC-F2 contig 47 fragment 3 

BAC-F2 contig 47 fragment 4 

BAC-F2 contig 47 fragment 5 

BAC-F2 contig 47 fragment 6 
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27 
28 
29 



BAC-F2 contig 47 fragment 7 
BAC-F2 contig 47 fragment 8 
BAC-F2 contig 47 fragment 9 



ABBREVIATIONS USED IN THE SUBJECT SPECIFICATION 



mardcl (10): 

HAC: 
YAC: 
10 MAC: 
PLAC: 

neocentromere: 

CENP: 
HC-contig: 

15 E8: 
F2: 
BE: 
TAR: 
PCR: 

20 Marker neocentromere: 
NC-contig 



Marker chromosome from patient BE: comprises a 
rearrangement of chromosome 10. 

Human artificial chromosome 

Yeast artificial chromosome 

Bacterial artificial chromosome 

Plant artificial chromosome 

A centromere containing no substantial a-sateUite DNA 
Centromere binding protein 

Region of normal chromosome 10 comprising 
neocentromere 

q' end/region of mardel (10) neocentromere 

p' end/region of mardel (10) neocentromere 

Patient from which mardel ( 10) identified 

Transformation-associated recombinant 

Polymerase chain reaction 

neocentromere on mardel (10). 

region of mardel (10) chromosome comprising 
neocentromere 



DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 



25 The present invention is predicated in part on the identification and isolation of nucleic acid 
m olecules exhibiting neocentromenc properties. In accordance with the present invention, a 
neocentromere is considered a centromere which does not contain substantial a-satellite DNA 
repeat sequences and, when activated, is capable of functioning as a centromere. The term 
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in this context M that ft. nucleic acid molecute does no, contain detectable 
0-satelli.e by F!SH analysis under medium stringency conditions. The neocenKomere may 
contain a small nun.bc, of nigh* diversed .-satellite DNA. In ptima.es. «— ">NA ts 
consider Hlbph in length. An nucleic acid moieou.. containing an activated neocentromere 
5 , r a neocentromere otherwise functioning as a centromere facuita.es i„ accordance wtth ft. 

invention, the nucleic acid moiecnle reheating, remaining extra-chromosotmd and 
segregating with cel. division. Reference berein to " * — » — 

centromere substantially devoid of -satellite DNA repeat sequences. 

,0 Accordingly, one aspect of Ore present invention provides an isolated nucleic acid molecule 
comprising a sequence of nucleotides which defines an eutaryotic neocentromere. 

More particularly the present invention provides an isolated nucleic acid molecule comprising 
, sequence of nucleoddes derived from a eukaryotic chromosome and encompassmg a 
,5 neocentromere which nucleic acid molecule when introduced into a compatible cell is capable 
of replicant, acung as an extra^bPomosomal elem». and segregaung with cell dtvston. 

The present inveruion is exemplified herein by the idennficauon and cloning of a human 
^centromere. This is done, however, wim the undenttanding that the present invention extends 
20 to alenxaryotic ncocenaomeres such as fromtmmma«an. pta, aviaor. msec*, fungal. yeast and 
„ptilian cltr„m<*omes. The most preferred oeoeentromere, however, is from human 
chromosomes and their mammalian homologues. 

The present invention is predicated in part on the identification of an unusual <*™°^ 
25 markerinapatientdesignated-BE". The chromosomal marker is referred to as "mardel (10) 
and results from a rearrangement of human chromosome 10. The mardel (10) marker „ 
mitotically stable and. in accordance with the present invention, contains a funcUonal 
neocentromere at a location regarded as non-centromeric. The neocentromere at mardel (10) 
is located between q24 and q26 on chromosome 10 and more particularly around q25. Even 
30 morcparucularly,^^ The present invention 

is exemplified by DNA cloned from the q24-q26 region of the mardel (10) chromosome as well 
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as the corresponding region on normal human chromosome 10. These DNA molecules contain 
a functional neocentromere. The present invention extends, however, to any neocentromere or 
any chromosome in mammalian and non-mammalian animals as well as plants, yeasts and 

fungi. 

5 For convenience, the DNA clones from the mardel (10) chromosome as well as from normal 
human chromosome 10 are summarised in Figure 1 1 . The neocentromere located at or around 
10q25 is located on a clone designated the "HC-contig". DNA clones from mardel (10) are 
referred to as "E8" or the "NC-contig* which extends from the long arm (q') of mardel (10) 

10 towards the short arm (p'). Clone F2 extends further p' from E8 (see Figure 11). It is 
emphasised, however, that the present invention extends to any neocentromere on any human 
chromosome as well as neocentromeres on other mammalian and non-mammal.an 
chromosomes including chromosomes from plants, insects, reptiles, yeast and fungi. 

15 The present invention further contemplates a nucleic acid molecule or Us chemical equivalent 
having a tertiary structure which defines a human neocentromere or a functional derivauve 
thereof or a latent, synthetic or hybrid form thereof or its mammalian or non-mammalian 

homologue. 

20 Even more particularly, the present invention is directed to an isolated nucleic acid molecule 
having a sequence of nucleotides or their chemical equivalents which directs a conformation 
denning a human neocentromere or a functional derivative thereof or a latent, synthetic or hybrid 
form thereof or its mammalian or non-mammalian homologue wherein the centromere associates 
with centromere binding proteins (CENP) -A and CENP-C or antibodies thereto. 

25 

Reference herein to "latent" in relation to a centromere includes reference to a centromere not 
normally functional but nevertheless activatable under certain conditions. A latent centromere 
may also be considered as a neocentiomere provided it has no substantial ^-satellite DNA repeat 
sequences. 

30 

The size of the neocentromere in accordance with the present invention may range from about 
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50b, to about ISOORbp.fro^^^ 

to about 300bp. ^o-p-d^— od^d-c— ™ 

5 In another embodiment, the centromere is about 80 kbp. 

wr rnntie for human chromosome 10 of the 
The nucleic acid molecule encompassing the HC-conag tor numan 

Lrra^ the NC-con,ig (part of m from mardel (10, is se, forth ,n Figu* « ^ 

,0 NO- 4, The nucleic acid a*** encompassing F2 ot mardel (10) is set fc* - «*» ' 6B 
a s«pa iaK co„U IS (SEQn>NO S;5 -29,.The„ U c te c» Mm ol«c„.«s h avea K ^s tt ,« U « 

^^e n ce^ tt «^o^nw hi ch tat umd«r, neS U»^^n««.Al 1 h« 0 8h.h, S 

a'centionsre and » * se, forth in one « more of Figu.es 6. 16A and/or 

,6B node, low .Agency —I- « <**C «"* * T"T 

having a. to. abou. 40% nucleotide similarity » one o, more sequence, se, fonh in Ftgures 6. 
20 16A »d,or 16B. MM* «- F— •> *— » is at lea,, about 50%. mo. preferably at 
,eas, about 60*. still more ,M « ^ about 70*. even more preferably a. leas, about 
80-90% ot above such as 95%. 97*. 98% and 99%. 

25 the HC contig and .Uniting sequence as well as nucleotide sequences — to YAC 3 and** 
YAC 5 at the homology, similarity or hybridization levels. 

Refc^Ke he*™ to a b. suingency a. 42°C includes and encompasses from « tost about 1% 
v/v to a, leas, about 15% v/v formamide and from a, to* about 1M to at leas, about 2M salt for 
30 hybridisation, and a. leas. abet.. ^ "^"^ 
stringy condition may be applied where necessary, such as medium suingency. winch 
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c . a hn.,t 1 6% v/v to at least about 30% v/v fonnamide and 
includes and encompasses from at least about 16% v/v tWMlt0 5M 
from at le ast about 0.5M » at least about 0.9M salt for hybridisaUo, and at least about 0.5M 
to at ieast about 0.9M sait for washing conditions, or high Stringency, which includes and 
enconpassesfromatle.staboutS^.v/v to at least about 50% v/v fonnarnide and from a t leas 
5 ZZoM to arfcast about 0,5M sait for hybridisation, and at least about 0.0 !M to a, .easr 
ahout 0. ISM sait for wasfcng conditions. TT*se stringency conditions may be altered dependent 
on the source of DNA and other factors. 

The term "similarity- as used herein includes «ac< identity between compared sequences at U>e 
,0 nuclide .eve!. Where .here is non-identity a, the nucleotide ievel. "sinnlarity 

differences between sequences which nevertheless result in conformation deftntng a funcuonal 

neocentromere. 

The nudeic acid molecule of.be prese* invention may comprise a naturally occurring nucleotide 
,5 sequence from a healthy human subject or may comprise the nucleotide sequence torn a human 
subject editing one or more chromosomal-dependent conditions such as a subject carryu,g 
^ 10 cruomosome or a chromosome conferring an equivalent or simUar condition or may 
carry one or more nudeotide substitutions, deletions and/or additions relative ,o the naturally or 
non-natunllyoccurringsequence. Soch n-diScauora are referred therein as "derivatives and 
20 include »«, fragments, pari, homologues and analogues of the namrally occumng 
rt* *quer*«. Preferably, the derivatives of the present invention soil define a functional 
neocentromere. 

Reference herein to a "neocentromere- includes reference to a functional neocentromere or a 
25 functional derivative thereof meaning tat i, is capable of facilitating sister chromatid coheston 
and chrorrosorral segregation .luring mitotic cell divisions attd/or U capable of associating w.lh 
CENP-A andfor CENP-C and/or is capable of interacting with anu-CENP-A antibodies or a„u- 
CENP-C antibodies. Generally, and preferably, the neocentromere is incapable of tnteracting 
with CENP-B or anti-CEP-B antibodies. Alternatively, me neocentromere may be a latent 
30 centrome* capable of activation by epigenetic mechanisms. The neooentromere may also be a 
hybrid of other human. matnmaJiarr, plant or yeas, neocentromeres. Synthetic neccentromeres 
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provided by, for example, polymeric techniques to arrive at the correct confrontation are also 
contemplated by the present invention. All such forms and definition, of neocentromere are 

encompassed by use of this term. 

5 Another aspect of the present invention provides an isolated nucleic acid molecule or chemical 
equivalent having the following characteristics: 

(i) comprises a nucleotide sequence or chemical equivalent directing a conformation which 
defines a neocentromere or a functional derivative thereof or a latent, synthetic or hybrid 
form thereof or; 

10 (ii) comprises a nucleotide sequence or chemical equivalent substantially as set forth in one 
or more of Figures 6, 16A and/ or 16B or having at least about 40% similarity thereto 
or capable of hybridising thereto under low stringency conditions at 42°C; and 
(iii) comprises a neocentromere capable of associating with CENP-A or CENP-C or 
antibodies thereto. 

15 

Preferably, the neocentromere is incapable of interacting with CENP-B or antibodies thereto. 

In a particularly preferred embodiment, the centromere corresponds to a human genomic region 
which maps between q24 and q26 on chromosome 10, and in particular q25 on chromosome 10. 

20 

The nucleic acid molecule or its chemical equivalent of the present invention defining a 
conformational neocentromere or functional derivative thereof or latent, synthetic or hybrid form 
thereof is useful inter alia for the generation of artificial chromosomes such as human artificial 
chromosomes (HACs), mammalian artificial chromosomes (MACs), yeast artificial chromosomes 

25 (YACs)and plant artificial chromosomes (PLACs). HACs are particularly useful since they are 
capable of accommodating large amounts of DNA and are capable of propagation in human cells. 
The HACs are non-viral in origin and. hence, are more suitable for gene therapy by, for example, 
introducing therapeutic genes. Furthermore, the HACs remain extra-chromosomal and. hence, 
have no insertional/substitutional mutagenic potential. The essence of a HAC is the presence of 

30 a neocentromere or latent, synthetic or hybrid form thereof which enables stable segregation 
during cell division. The HAC also remains extra-chromosomal and, hence, is more suitable for 
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gene therapy. Referee to W*—— T — *« " *» »°< ~ int ° ,be 
chromosome and, in effect, is episomal. 

Accordingly. the present invention provides a generic consume, conning an origin of 
5 replication for a eulcaryotic ceil and a nucleic acid molecule encompassing a eukaryouc 
^nrronwe or a functional derivative thereof or a latent, synthetic, hybrid form thereof or its 
^rnmalan or non-mammalia, homologue flanked by telomeric nucleotide sequences functional 
in .he cel. in which the genetic construct is to replicate and wherein said genetic construct when 
introduced into a cell is a replicating, extra-chromosomal element which segregates wrth cell 
10 division. 

More particularly, the present invention further contemplates a genetic construct in the form.of 
an artificial chromosome comprising an origin of replication for a mammalian, human, plant or 
yeast cell and a nucleic acid molecule encompassing a human neocentromere or a functional 
15 derivative thereof or a latent, synthetic or hybrid form thereof or its rr^mrnai,- or 
non-mammalian homologue flanked by telomeric nucleotide sequences functional in the cell u. 
which the artificial chromosome is to replicate. 

Another embodiment provides a genetic construct in the form of an artificial chromosome 
20 comprising an origin of replication for a marnrnalian, human, plant or yeast cell and a nuclexc acid 
molecule having a tertiary structure which defines a human neocentromere or a functional 
derivative thereof or a latent, synthetic or hybrid form thereof or its mammalian homologue 
flanked by telomeric sequences functional In the cell in which the artificial chromosome » to 
replicate. 

25 Yet another embodiment is directed to a genetic construct in the form of an artificial 
chromosome comprising an origin of replication for a mammalian, human, plant or yeast cell and 
a nucleic acid molecule having a sequence of nucleotides which directs a conformation defining 
a human neocentromere wherein the centromere associates with CENP-A and/or CENP-C or 

30 antibodies thereto and does not contain substantial a-sateUite DNA repeat sequences, said 
nucleic acid molecule flanked by telomeric nucleotide sequences functional in the cell which the 
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artificial chromosome is to replicate. 

Sti,. yet mother aspect cf the present invendon relates to a genetic construct in the form of an 

^fficiaUiuorno^cornpns^ 
5 cell and a nucleic acid molecule comprising a sequence of nucleotides which: 

(i) directs a conformadon which defines a neocentromere or a functional form thereof or a 
latent, synthetic or hybrid form thereof; 

(ii) comprises a nucleotide sequence substantially as set forth in one or more of Figures 6, 
16A and/or 16B or having at least about 40% similarity to the nucleotide sequences set 

10 forth in Figures 6, 16A and/or 16B or is capable of hybridising to one or more of these 

sequences under low stringency conditions at 42°C; 
wherein the neocentromere is capable of associating with CENP-A and/or CENP-C or antibodies 
thereto and wherein said nucleic acid molecule is flanked by telomeric nucleotide sequences 
functional in the cell in which the artificial chromosome replicates. 

^ In a preferred embodiment, the genetic construct is a HAC and comprises human telomeric 
sequences. In a particularly preferred embodiment, the HAC further comprises yeast arttfc.al 
chromosome (YAC) arms and, hence, becomes a HAC/YAC shunie vector capable of 
propagation in human and yeast cells. Preferably, the HAC/YAC contains a unique enzyme site 

20 between yeast telomeric sequences and human telomeric sequences such that upon contact wUh 
the particular enzyme, the yeast telomeric sequences are removed leaving the human telomeric 
sequences. Preferably, the unique enzyme site is a yeast specific enzyme site such as I-Scel. 

According to this embodiment, there is provided a genetic construct defining a HAC/YAC 
25 comprising an origin of replication and a nucleic acid molecule encompassing a human 
neocentromere or a functional derivative thereof or a latent, synthetic or hybrid form thereof or 
a mammalian or non-rnammalian homologue thereof, said nucleic acid molecule flanked by 
human telomeric sequences which are in turn flanked by yeast telomeric sequences wherein a 
unique enzyme site is located between the human and yeast telomeric nucleotide sequences such 
30 that upon contact with the enzyme, the yeast telomeric sequences are removed and the human 
telomeric sequences are exposed. 
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More particularly, the present invention is directed to a genetic construct defining a HAC/YAC 
comprising an origin of replication and a nucleic acid molecule encompassing a human 
centromere or a functional derivative thereof or a latent, synthetic or hybrid form thereof or a 
roarnmalian or non-mammalian homologue thereof wherein the neocentromere associates with 
5 CENP-A and/or -C or antibodies thereto and does not contain substantial a-satellite DNA 
sequences wherein said nucleic acid molecule is flanked by human telomeric sequences which are 
in turn flanked by yeast telomeric sequences wherein a unique enzyme site is located between 
the human and yeast telomeric nucleotide sequences such that upon contact with said enzyme, 
the yeast telomeric sequences are removed and the human telomeric sequences are exposed. 



10 
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Even more particularly, the present invention is directed to a genetic construct in the form of a 
HAC/YAC comprising an origin of replication and a sequence of nucleotides which directs a 
conformation defining a human neocentromere or a functional derivative thereof or a latent, 
synthetic or hybrid form thereof or a mammalian or non-mammalian homologue thereof wherein 
15 said neocentromere is capable of associating with CENP-A and/or CENP-C or antibodies 
thereto, said sequence of nucleotides flanked by human telomeric sequences which are in turn 
flanked by yeast telomeric sequences wherein a unique enzyme site is located between the 
human and yeast telomeric nucleotide sequences such that upon contact with said enzyme, the 
yeast telomeric sequences are removed and the human telomeric sequences are exposed. 

Preferably, the length of the nucleotide sequence is between about 30 kpb and 1 500 kpb, and 
more preferably between 60 kbp and 1000 kpb. 

In a particularly preferred embodiment, the unique enzyme site is a yeast specific enzyme site 
25 such as I-Scel. 

The present invention extends to yeast cells and human cells carrying the genetic constructs of 
the present invention and to proteins produced therefrom. 

30 The genetic constructs may also comprise marker genes and other unique restriction sites to 
facilitate insertion of adventitious DNA. Accordingly, the genetic constructs of the present 
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invention may further comprise adventitious or heterologous DNA encoding . product of 
interest. Prefer product of interest Wude pharmaceutical* useful genes such as genes 
encoding cytokines, receptors, grow* regulators and the like. Endogenous genes may aiso be 
replaced by wild-type genes or modified genes. 

The adventitious or heterologous DNA may also encode a molecule not synthesised in a 
sufficient amount in a particular subject and hence the increased copy number permits greater 
amounts of the molecule being synthesised. 

10 Accordingly, the present invention contemplates a genetic construct comprising an origin of 
replication and a first nucleic acid molecule defining a human neocentromere or a functional 
derivative thereof or latent, synthetic or hybrid form thereof or a mammalian or non-mammalian 
homologue. a second nucleic acid molecule encoding a peptide, polypeptide or protein, wherem 
said first and second nucleic acid molecules are flanked by a first set of human telomenc 

15 sequences which are in turn flanked by a second set of yeast telomenc sequences wherein there 
are unique enzyme sites between the human and yeast telomenc sequences such that upon 
contact with said enzyme, the yeast telomeric sequences are cleaved off to expose the human 
telomeric sequences, 

20 Reference herein to segregate preferably rneans mitoticaUy stable segregation. Conveniently, 
stable segregation may be determined as the presence of an artificial chromosome in 40-60% of 
daughter cells after 4-6 months of continuous passage. 

The present invention extends to other artificial chromosome analogues to the HACs and 
25 HAC/YACs described above such as MACs and PLACs. 

Another aspect of the present invention relates to peptides, polypeptides and proteins which bind, 
interact or otherwise associate with the human neocentromere of the present invention or its 
rnarnmalian and non-marnmalian homologue. Preferably, the molecules are proteins, referred to 
30 as primary proteins. The 1- proteins bind to the neocentromere and secondary (2°) proteins 
bind to the I" proteins before or after association with the neocentromere. The identification 
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of the human neocentromere in accordance wiih the present invention provides a mechanism for 
assaying 1* proteins and 2° proteins which may be important for screening chromosomes in. for 
example, genetic disorders. This is particularly the use in Down's Syndrome which results from 

defective chromosome segregation. 

5 The 1« proteins are readily detected by, for example, a gel shift assay. The nucleic acid molecule 
of the present invention defining the human neocentromere is digested, labelled and contacted 
with nuclear extract putatively containing the 1 • proteins and resolved on a gel. When a V 
protein binds to a fragment carrying a binding portion of the neocentromere, the DNA fragment 

10 migrates in the gel at a slower rate due to the bound protein. 

The present invention extends to purified 1 ° proteins capable of association with the subject 
centromere and to genetic sequences encoding same and to antibodies thereto. 

15 The neocentromeres of the present invention are readily identified and characterised using, for 
example, human fibrosarcoma cell lines. For example. DNA suspect of carrying a 
neocentromere. is introduced into fibrosarcoma cells in a linear form, generally together with a 
telomeric sequence. The cells are then screened for the presence of replicating, extra- 
chromosomal and segregating elements, referred to as mini chromosomes. 
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The present invention further encompasses eukaryotic cells carrying replicating, 
extrachromosomal and segregation nucleic acid molecules. Preferably the eukaryotic cells are 
ttarronalian cells and most preferably human cells. The nucleic acid molecules according to this 
aspect of the present invention are preferably as herein described. Particularly preferred cells are 
25 HT-38, HT-47, HT-54, HT-190, HT-191, BAC/E8-1, and BAC/F2-14. 



The present invention is 



further described by the following non-limiting Figures and Examples. 



r-\orcR\EJHtf > OM'>i-w , CT- iwi 



24- 



EXAMPLE 1 
YAC and Cosmid Probes for FISH 

YACs carrying specific STSs were identified (Moir et al, 1994) by PCR-based screening of 
5 YAC libraries prepared in P YAC4 vector at the Center for Genetics in Medicine at Washington 
University (Brownstein et al, 1989) and at the CEPH (Albertsen et al. 1990). Cosmid DNA 
inserts (3M0 kb) were ligated to SuperCos I vector (Stratagene) and packaged with Chgapack 
III Gold extract (Stratagene) according to the manufacturer's instructions. YAC probes were 
prepared by Alu-PCR of total yeast genomic DNA using primers 5'- 
10 GGATTACAGG(C/T)(A/G)TGAGCCA-3' [SEQ ID NO:lJ and 5'- 
(A/G)CCA(Cyr)TGCACTGCAGCCTG-3' [SEQ ID NO:2] according to published method 
(Archidiacono et al. 1994). For probe labelling, 1 ug of the YAC PCR products or whole 
cosmid DNA isolated by CsCl centrifugation or Qiagen column was used. The DNA was 
labelled with Biotin-16-dUTP (Boehringer Mannheim) using a NICK translation kit (Boehmger 
15 Mannheim). A probe rrux of 6-10 ug/nd of bio^ DNA 
(Boehringer Mannheim). 500 ug/ml of carrier salmon sperm DNA and, where indicated, 10 
ug/ml of biotinylated 10pC38 tag DNA was ethanol precipitated, resuspended in a hybridization 
mix of 50% v/v formamide in 2 x SSC and 10% w/v dextran sulphate, denatured at 95°C for 
5 min. preannealed for 30-60 min at 37'C to suppress repetitive sequences, before adding to 
20 slides. FISH of a-satellite and satellite III probes was performed under low stringency as 
previously described (VouUaire et al, 1993). 

EXAMPLE 2 
Somatic Cell Hybrids and Other Cell Lines 

25 

Skin Fibroblasts and transformed lymphoblast cell lines were established from patient BE 
(Voullaire et al, 1993) and from his normal parents. The presence of the mardel 10 chromosome 
in the patient cell lines was confirmed by FISH. In addition to these cell lines, two somatic cell 
hybrids were produced by fusing cultured fibroblast cells derived from patient BE with the 
30 Chinese hamster ovary cell line CHO-K1 using polyethylene glycol. Hybrid cells were selected 
in a proline-free medium for the glutamic oxaloacetic transaminase- 1 (GOT-1) gene located in 
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lW regioa One rf ft, hybridceU Unes. designated BHCMMt was shown to con,a>„ 
,he normai chromosome .0 but no, the marker chromosome, whUe another hybrid ceU Une. 
designated BE2C1-1 8-SF. contained the marker chromosome bo. no. the normal chromosome 
10 of pane* BE. The presence or absence of these chrotnosomes was established by karyotyptng 
5 and ANTI-CEN/FISH probing. In addiUon. PCR anaiysis of an STS (sequence tagged s«e> 
marker, AFM2 5 9>g5, which resided on YAM. confirmed ft. status of rhese chromosomes m 
ft. hybrids and excluded fte presence of submicroscopic fragments of me marker centromere 
region within the genome of BE2Cl-18-lf, or the presence of fte corresponding regton of 

. . i_ „ ~p Tjpon iR-5f Use of this STS marker also 

normal chromosome 10 within the genome of BE2C1-18 5t. use 01 

10 demonstrated that the mardel 10 chromosome has originated from the patient's father. 



EXAMPLE 3 
Antisera 



,5 Antiserum CREST #6 was from a patient with calcinosis, Eaynaud's phenomenon, esophageal 
dysmotility, sclerodactyly and ^langiectasia (a constellation of symptoms commonly referred to 
as "CREST"; Moroi ei at. 1981; Fritzler and Kinsella, 1980; Brenner et al, 1981). Western blot 
analysis of this antiserum indicated that the primary antigens detected were human CENP-A and 
CENP-B. A specific anti-CENP-C polyclonal antibody, designated Am-Cl, was produced by the 

20 inventors by expressing a partial rr^use CENP-C polypeptide to 345) as a GST- 

fusion product in E, coli. followed by gel purification of the product and its use as an anUgen for 
antibody production in rabbit. 

EXAMPLE 4 

25 Preparation of Standard Metaphase Chromosomes for FISH analysis 

Actively replicating transformed lymphoblasts were incubated at 37*C for 17 h in the presence 
ofO iMilnalconcentnition of thymidine before they were centrifuged at 2000 rpm for 10 mm. 
washed with pre-warmed RPMI. and incubated for a further 5-6 h. 15 min before harvestmg, 
30 colcemid (lO^g/ml) was added. Cells were harvested according to standard cytogenetic 
techniques using 0.075M KC1 hypotonic solution for 15 min at 37°C, followed by three fi*auve 



26 



washes in ice cold me-tanol/acetic acid 3:1, dropped omo clean glass sUdes, aad s.ored 
dessicated at -20°C until required. 

EXAMPLE 5 

5 e«para.io„ .rMechaaicaU, StreKhed Chromosomes for ANTI-CEN/FISH Mapping 

METHOD - 1 

This is an adaptation of the method described by Page et al (1995). Colcemid (lO^g/ml) was 
added to actively dividing transformed lymphoblasts for 2-3 h. before the cells were centnfuged 
10 at 1500 rpm for 10 min, washed in PBS, and resuspended in 0.075M KC1 hypotonic solution 
for 10 min at RT at a concentration of approximately 5x10* cells/ml; the use of fewer cells 
here gave better stretching of the chromosomes. 200-300ul of this suspension were then 
cytccentrifuged onto clean microscope slides using a Cytospin 2 (Shandon) at 1000 rpm for 5 
min at high acceleration. The slides were immediately removed, placed flat in a shallow d 1S h 
15 and very gently flooded with KCM (Potassium Chromosome Medium:120 mM KC1. 20mM 
Nacl. lOmM Tris-HCl, 0.5mM Na 2 EDTA, 0.1% v/v Triton X-100) (Jeppesen et al. 1992). 
After 10 min at RT, immunofluorescence was performed without fixation (Earnshaw and 
Migeon, 1985; Earnshaw et al. 1989; Jeppesen et al. 1992; Jeppesen and Turner. 1993). KCM 
buffer was gently aspirated and 50ul of CREST#6 serum [diluted 1:50 in 1 x TEEN (ImM 
20 Triethanolamine HC1, 0.2 mM Na 2 EDTA. 25 mM .NaCl). 0.1% v/v Triton X-100, 0.1% w/v 
BSAJ was added to the cell area of the slide and covered with a parafilm coverslip. The shdes 
were incubated for 30 min at 37°C, then washed very gently by flooding in 1 x KB" [10 mM 
Tris-HCl (pH7.7), 0.15M NaCl, 0.1% w/v BSA), three rinses of 3 min each at RT. The primary 
antibody was detected with Texas Red-conjugated Affini-pure Rabbit anti-Human IgG (H&L) 
25 (Jackson Laboratories) diluted 1:50 in I x KB". 50 ul was added to each slide, covered with a 
parafilm coverslip, and incubated for 30 min at 37-C. The slides were again gently washed by 
flooding in 1 x KB" for 2 min at RT, before they were fixed by flooding in 10% v/v formalin 
in KCM for 10 min at RT. followed by three rinses of 3 min each in distilled water. If FISH 
was not performed the slides were rinsed in PBS and mounted in DAP1 (0.25 ug/ml) in 
30 DABCO antifade mountant. [In experiments where CREST#6 and Am-Cl antisera were 
simultaneously used to label the centromere (Figs. 2B and C), the above procedure was 
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ci antibody w« detected using 1=100 di>u,ed Donkey anti-Rabbi. DTAF (Jackson 

Laboratories)]. 

5 if PISH -as to be performed on the slides, .hey were .hen given a second fix in 3:1 
met han„,aceuc acid for .5 min a. RT. The s.ides were air dried for a, ieas. 5 rran anc . -h- 
processed for FISH or s«ored « -20'C for up «o several days before continumg. For FISH, .he 
sUd cs were dehydra.ed a, RT in 70%. 90%. 100% v* etna*,, (2 min each, an « 
Chromosoma. DNA was denary in demised 70% v/v ,ormamide/2xSSC. pH 7.0 « 82 C 
■ ,0 for 8 min foilowed by immediaK dehydration in 70%. 90% and 100% v/v ethano. at -20 C for 
2 min each, men air dried for at leas. 10 min. (This high .emperature of den— was 
critica! to oWain maximum FISH signals). An amount of 15 ul of the prepared probe was 
added ,0 each slide, covered with a 22mm* covers.*, and sealed with rubber cement. Sltdes 
were hybridized overnight in a humid chamber a. 3TC men rinsed in 2 x SSC at RT, followed 
15 by 3 washes of 0.1 x SSC a, 60-C for 5 min each, rinsed again in 2 * SSC. and immersed ,„ a 
blocking agent of 5% non fat mil* in 4 x SSC for 10 min at RT. Probe hybridization was 
detected by incubation with FITC-conjugated avidin a, 37-C for 30 min. Mowed by three 
washes of 5 min each at RT in wash buffer (4 x SSC. 0.05% v,v Tween-20). Signals were 
amplified by incubating with goa. anti-avidin D antibodies for 30 min a. STC followed by 
20 three washes of 5 min each at RT in wash buffer, men with another layer of avidm-FITC for 
30 min a. 37°C, before the slides were washed in wash buffer, rinsed in PBS. and coun.er- 
stained with DAP1 (0.25 ug/ml) in DABCO mountant. 

METHOD - n 

" The following method was modified from mat of Haaf and Ward. (1994). Actively dividing 
lymphoblast cells were tieau* wit), lOug/ml colcemid for 2-3h. washed in PBS and 
suspended in a hypotonic solution consisting of !0mM Hepes ( P H7.3). 30mM glycerol, 
1 OmM Cad, and 0.8mM MrfV at a ceU density of approx. 2.5 x lOVml. After 10 mm of 

30 hypotonic treatment a. RT, 300 Ml «re cytocentrifuged (Shandon - Cytospin 2) onto glass 
slides a, 800 rpm for 4 min. The slide, were immediately removed from the centrifuge, dned 
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Ior ,5sec. fixed in — at-20'C for 20-30 min. rinsed *^«™ 
then washed in 3 rinses of PBS at RT. Immunofluorescence staining was done using CREST#6 
at a dilution of 1 :50 in PBS. After incubation at 37»C for 30 min, the slides were washed three 
,mes in PBS for 2 min each. This primary anUbody was then detected by a further incubatu>n 
5 for 30 min at 3VC with Texas Red-conjugated Rabbit anti-Human IgG diluted at 1:50 m PBS. 
The slides were fixed in 10% v/v formalin in KCM for 10 min at RT. then washed in 3 nnses 
of distilled water and drained. Before FISH was performed, slides were fixed in methanol 
acid 3:1 for 15 min at RT and air dried. Chromosomal DNA was denatured in 70% v/v 
deionised formamide (pH7.0) in 2 x SSC at 82°C for 4-6 min. After dehydration in an ic cold 
' 10 ethanol series the slides were air dried, and used for FISH as described for Method I. Slides 
could be stored covered in foil at RT after methanol/acetic acid fix for up to several weeks 
before FISH. 



Both 

15 



methods I and U were used to obtain the results shown in Figs. 2B. 2C. 3 and 4B. 



EXAMPLE 6 
Image Analysis 



Hybridization signals for YAC mapping on standard metaphase preparations utilized a normal 
20 fluorescence microscope. Images for the ANTI-CEN/FISH experiments were analyzed on a 
Zeiss Axiolab fluorescence microscope equipped with a lOOx objective and a cooled CQD 
camera (Photometries Image Point) controlled by a Power Mac computer. Gray scale images 
were captured separately using a LUDL filter wheel and controller for Texas Red. FrTC and 
DAPI These images were pseudocoloured and merged using IPlab Spectrum software from 
25 Signal Analytics Corporation. A number of difficulties were commonly associated with the 
ANTI-CEN/FISH technique: (a) the deliberate "stretching" of the chromosomes, whilst 
increasing the resolution of mapping, sometimes caused serious distortion to the chromosomes, 
often making them quite dysmorphic; (b) FISH treatment following the ANTI-CEN-labelling 
often significantly reduced the ANTI-CEN signals; (c) more highly stretched chromosomes 
30 (which would potentially give better mapping resolution) generally gave weaker ANTI-CEN 
signals; and (d) the ANTI-CEN signal on the mardel 10 centromere was usually weaker than 
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those of the other human ch.omoso.ne, Thus, a cell would only be considered informal an 
used for scoring if bo* the p'- and q'-arms of .he marde. 10 chromosome were discen,.ble and 
seated by a disc** ANTI-CEN signal. In addition, FISH signals for both the test probe and 
the 10PC38 cosmid tag (used to identify the ,'-arm of. and thus orientate, the marker 
5 chromosome) must be clearly present. Using these criteria, the overaU frequency of informal 
cells was found to be approximately 1 in every 20-30 metaphases analyzed. 



EXAMPLE 7 
Restriction Analysis of Patient DNA 



10 



High-molecular weight genomic DNA was extracted from cultured fibroblast cell lines of 
patient BE and those of his parents and digested with different enzymes to generate restncUon 
fragments ranging from <lkb up to -1 Mb. The digested DNA was resolved either on a 
standard agarose gel or by pulsed-field gel electrophoresis (PFGE) using a Bio-Rad CHEF-XA 
15 Mapper. For filter hybridization, 50-100 ng of whole cosmid or PAC DNA was labelled by 
random priming. The labelled probe was then added to 2 ml of hybridization buffer (0.5M 
Na HPQ 4 7% w/v SDS. 1% w/v BSA, ImM EDTA, P H. 7.0) containing 500 ug of human 
placental DNA (Sigma). The mixture was boiled for 5 min, then placed in a 65°C water bath 
for preannealing of repetitive DNA for 90 min. The preannealed probe mix was then added to 
20 prehybridizing filters and hybridized overnight at 65°C. Post-hybridization washes were at a 
final stringency of 0.1 x SSC, 0.1% w/v SDS at 68°C. 

EXAMPLE 8 

Identification of a YAC region spanning the marker centromere 

^ The initial search for DNA sequences spanning the centromere of the mardel 10 chromosome 
was based on fluorescence in situ hybridization (FISH) of existing cosmid and YAC clones 
(Moir et al. 1994; Zheng et al, 1994) that have been mapped to the q24 - q26 region of the 
normal human chromosome 10 where the new marker centromere was formed (Voullaire et al, 

30 1993) (Fig. 1 A). This search led to the identification of a 4 megabase YAC contig (designated 
#082) that spanned the marker centromere region (Fig IB). Fig. 1C graphically presents the 
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FISH mapping results with selected YACs from this contig. As can be seen, two of the YACs 
(YACs-1 and YAC-2) mapped to the q'-side of the marker centromere, whereas the remammg 
YACs mapped to the p'-side of the centromere. The low signal level observed for YAC-3 was 
due to a large proportion of this probe hybridising directly on the centromere itself. These 
5 results therefore, provided evidence that YAC contig #082 spanned the marker centromere, and 
that the centromere region was likely to be within YAC-3. where the "cross-over" between the 
q' and p' signals occurred. 



1Q EXAMPLE 9 



Development of Improved ANTI-CEN/FISH Methods for the Simultaneous 
Detection of Marker Centromere and Single-copy Cosmid DNA Probes 

Although normal fluorescence microscopy and FISH analysis of standard metaphase 
15 chromosomes were adequate for the initial identification of the YAC contig spanning the marker 
centromere, methods with significantly higher sensitivity and resolution were needed to allow 
further walking into the marker centromere DNA. Three requirements have to be satisfied by 
these methods: (a) the metaphase chromosomes have to be extended to offer much greater 
mapping resolution, (b) the centromeres have to be more precisely defined than that offered by 
20 a cytogenetic constriction, and (c) the methods should allow simultaneous visualization of both 
the centromere antibody and FISH signal. Two published methods were explored (designated 
here as ANTI-CEN/FISH methods) based on extending metaphase chromosomes by mechanical 
stretching and labelling of the neocentromere by autoimmune antibodies (Haaf and Ward, 1994; 
Page et al. 1995). Since these methods were originally established for the labelling of nonnal 
25 centromeres and for FISH analysis of highly repeated DNA. they were modified (see Example 
4) to allow detection of the generally reduced ANTI-CEN signal of the subject marker 
neocentromere and the lower FISH signals resulting from the use of single-copy cosmid DNA 
probes. 

30 With the improved detection methods, the status of a-sateUite and satellite III DNA on the 
marker neocentromere was reassessed, since this was previously determined using standard 
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^oscopy and FISH (Voullaire aL 1993). Fig- 2A shows the result of anu^y 

on the marker centromere. The same result was obtained when the experiments wcrerepeated 
without ANTI-CEN-labelUng. ruling out the possibility that the anti-centromere anUbody might 
5 have obscured any weak FISH signal, Similar results were obtained with satelltle HI WK. 
Since in separate reconstruction experiments, it was possible to demonstrate the sens.uv.ty of 
lhe procedure in detecting a smgle-copy DNA probe of less than 1.5 kb. and makmg the 
enable assumpuon that the low-stringency hybridization conditions used for the a-satelhte 

' 10 hybridisation of these probes to all the other centromeres, .ould have allowed the detectton of 
any related sequences, it can be concluded that these satellite are absent. 

EXAMPLE 10 

Co-localization of CENP-C and CENP-A on the Marker neocentromere 

15 To test if CENP-C is present on the marker centromere, a specific rabbit polyclonal antibody was 
prepared against arecombinant product of mouse CENP-C. This antibody, designated Am-Cl. 
reacted strongly with the centromeres of rodent and human chromosomes. Fig. 2B shows results 
for the labelling of stretch* human metaphase chromosomes using this antibody simultaneously 
20 with the CREST#6 autoimmune antibody. As can be seen, irrespective of the degree of 
chromosome stretching, the signals for the two antibodies coincided fully on all the centromeres. 
The localization of these two antibodies on the marker chromosome was further determined by 
employing the 10pC38 cosmid tag in an ANTI-CEN/FISH experiment to identify the marker 
chromosome. The results indicated that both the antibody signals were clearly present and agam 
25 coincided completely on the marker centromere (Fig. 2C. a-e). Although CREST #6 was known 
to bind CENP-A and CENP-B, indirect evidence suggests that binding to the marker centromere 
presumably occurred via CENP-A since the presence of the marker centromere was previously 
demonstrated no, to bind CENP-B (Voullaire e< al, 1993). The above results, therefore, 
established the localization of CENP-C, and probably CENP-A, on the marker centromere. 

30 
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EXAMPLE 11 

Location of the anti-centromere anuWy-binding domain 

For funher waiting M> d. marker regkn, cosmid libraries were prepared from total 

, yeast genomic DNA containing YACs-2, -3. -4. -6. -7. -13. and - .7. Cosmid clones contarnmg 
LI DNA inserts wore Isolated b, hybridization with human COT-. DNA using low 
Agency. AD toting cosmids were screened by standard FISH to confirm fteir —on 
to the expected madcer centromere and norma, chromosome .0 regions, and to el.rn.nate Cones 
ft. might have originated from other genonnc sites due to chimeric YACs. Positive c,o„es were 
,0 then ana.yzed tafter with the ANTI-CEN/FISH methods, using OBS» » «* * 
centromere. Fig. 3. (I and II) show examples of cosmid signals that mapped to the , - and p - 
side respectively, of the maxker centromere in the ANTI-CEN/FISH experiment, The cosmrd 
t ag'(clone 10pO8) was used in these extents to defute ft. * arm of the marker 

15 mapping of YAC contig *082 indicated that the marker centromere region was likely to be * lu ^ n 
mis YAC Fig. 4a shows a restriction map of the region covered by this and surrounding YACs 
and compares mis map with a genomic map derived from patient BE. The relative positions of 
a series of cosmid clones (incWing Sve indent PACs) were also determined and placed on 
the YAC map. Kg. 4i> presents the ANTI-CEN/FISH results obtained with a number of the 
20 cosmid clones and one of the PAC clones. Clones Y3C64. Y6C8, and Y3C94 IccaUzed 
preferential to the q'-side, while Y13C1+C8 and Y17C6 localized preferentially to the p'-stde 
of the marker centromere, suggesting mat the nucieus of the antibody-binding domain is situated 
between these two cosmid clusters. Within mis central region, a group of cosnud clones 
comprising the HOonug (Fig. 4*) was found to map closely around the ANTI-CEN signal Fig. 
25 4c shows a restriction map for eight different overlapping clones from this HC-conttg. The 
chrcmosomal positions of five of these overlapping clones were analyzed in detail using ANTI- 
CEN/FISH Kg. 46 shows the cumulative results for more than 60 informative chromosomes for 
each of these five probes. The results indicated that Y7C14 mapped preferentially of the 
antibody-binding don* while the remaining four clones (Y4C45. Y6C10. Y6C2. and Y3C3) 
30 mapped preferentially to the p'-side. In addition, the results for PACS (a 75 kb-insert PAC clone 
mat overlapped with the p'-end of PAC4 by approximately 5 kb; see Fig. 4a) provided further 
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evidence for the emergence of the HC^ontig region onto the p'-arm. Based on these results, we 
conclude that the eight contiguous cosmid clones within the HC-contig shown in Fig. 4c, which 
together constitute -80 kbp of DNA, have defined the nucleus of the antibody-binding domain 
of the marker centromere. 

5 

From the above ANTI-CEN/FISH results, it was difficult to determine if the sequences of the 
HC-contig and its surrounding DNA, both originally derived from a normal individual, were part 
of the marker centromere DNA, or whether these sequences simply flanked a transposed 
centromere DNA with an unrelated nucleotide composition. However, supporting evidence from 

10 the ANTI-CEN/FISH experiments suggested that the DNA of the HC-contig region appeared 
to be a part of the marker centromere. This came from the mapping of Y6C10 and Y6C21 onto 
superstretched chromosomes thai were occasionally detected in the slide preparations. An 
example of such mapping is shown in Fig. 3b using Y6C21. As can be seen, whilst a significant 
portion of Y6C21 hybridized to the p'-side of the CREST signal on the highly extended 

15 chromosome, a substantial portion of the cosmid DNA also overlapped directly with the CREST 
signal. This suggests that at least part of the HC-contig region actually comprises the same DNA 
sequence as the marker centromere. This possibility was further investigated by detailed genomic 
mapping. 

20 EXAMPLE 12 

The Marker Centromere DNA has a Similar or Identical Sequence 
Organization as the HC-Contig 

The genomic organization of the HC-contig region was compared with that of the 
25 corresponding DNA region of the mardel (10) chromosome. Three overlapping cosmids 
(Y7C14, Y6C10, and Y4C7, the latter being essentially the same as Y6C21; Fig. 4C) from the 
HC-contig were used as probes to analyze the restriction patterns of genomic DNA prepared 
from patient BE and those of his karyotypically normal parents. Fig. 5 shows examples of the 
band patterns obtained with Y6C10, while Table 1 summarizes the results for all the enzymes 
30 tested with Y7C14, Y6C10 and Y4C7. The detection of a single band on PFGE gels with a 
number of the enzymes indicated that the cosmid DNA sequences were unique within the 
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human genome (5/n, &«. Kspl, Kpnl and Bctl in Fig. 5A; Table 1). The detection of a single 
on PFGE gels with a number of the enzymes (C/oI in Fig. 5A; Table 1 ) could be explained by 
differential mediation of different restriction sites found in this region (Nelson and 
McClelland, 1991); the reproducibility of these multiple band patterns ruled out mcomplete 

5 digestion as a possible cause. The multiple bands detected with the more frequent cutting 
enzymes on a standard gel (Fig. 5B and Table 1 ) were a result of the presence of cleavage sues 
present within the probe DNA, since similarly digested cosmid DNA electrophoresed next to 
the genomic DNA yielded identical patterns for all the bands not containing cosmid vector 
sequences. In all, 37 enzymes were used to generate more than 160 different fragments for the 

10 three cosmid probes (Table 1). The results indicated that, except for a polymorphic fragment 
found in one of the parents, an identical banding pattern was present in the genomic DNA of 
patient BE and those of his parents. Furthermore, when the restriction patterns obtained for the 
genomic DNA of patient BE were compared with those of the smatic hybrid cell line BE2C.1- 
18-5F, which contained the marker chromosome but not the normal chromosome 10, no 

15 detectable difference was seen between the two DNA preparations within the HC-contig region 
(Fig. 5C). 

In addition to Y7C14, Y6C10 and Y4C7. a host of other probes from within or surrounding the 
HC-contig have been tested, each with an average of 12 different informative enzymes. These 

20 probes included PAC4 (which spanned the entire HC-contig region shown in Fig. 4C), Y3C64, 
Y3C109, Y6C6, Y6C8. Y3C94, PAC1, Y3C90, Y4C4, Y4C8, Y4C13, and Y3C33. The results 
again indicated identical restriction enzyme patterns between patient BE and normal DNA. 
Thus, through the analysis of a relatively large number of probes covering about 500 kb of 
YAC-3 around the HC-contig region, and the use of a high density of restriction enzymes that 

25 generated a range of fragments from <l kb to -1 Mb, it was evident that the marker centromere 
DNA and a substantial stretch of its adjoining regions showed no detectable difference against 
the corresponding genomic region of the normal chromosome 10. 

Since a potential limitation of the above Southern blot analyses was that highly repeated 
30 sequences were not detected because of the prcanncaling step used in the hybridisation 
procedure, a different approach was employed to compare the DNA of the marker chromosome 
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and tat of the norma! chromosome 10. In this approach. oligonucleotide primers from different 
regions of the HC-contig were used to prepare a series of PCR fragments from the BE2C1-18- 
5F and BE2C1-I8-1F hybrid cell lines. Electrophoretic comparison of such fragments, whrch 
random., covered approximately 40 kb of the HC-contig. indicated no detecutble difference 
5 between the two chromosomes *td provided independent support for the results obtained . the 
Southern blot analyses. Titus, it can be concluded that the sequence organization of the marker 
centromere region is similar, if no. identical. ,0 that found in the HC-contig region of the 
normal chromosome 10. 



10 

EXAMPLE 13 



Implications for Centromere Study and Mammalian 
Artificial Chromosome Construction 

15 The rnamrnalian centromere has been difficult to study due to the massive amount of repetitive 
DNA normally associated with it. By avoiding such repetitive DNA and analyzing the unusual 
centromere found in the present marker chromosome, the inventors have created a much more 
tractable system for centromere studies. The present analysis has already shed some light on the 
important question of DNA sequence versus conformational requirement of a centromere, and 
20 on the intriguing concepts of latent centromeres and epigenetic mechanisms. One urgent 
application of this DNA is to use it to identify the primary protein(s) which binds to the 
centromeric DNA. Another important application of the marker centromere DNA is in the 
construction of marnmaUan artificial chromosomes. Such artificial chromosomes offer a 
potentially powerful vehicle for the structural and functional analysis of chromosomes, for the 
25 genetic manipulation of plants and animals, and for the stable transmission of therapeutic genes 
in human gene therapy. The artificial chromosomes require a functional mammalian centromere, 
and the marker centromere DNA element of the present invention now provides a suitable 
centromere especially because of its relatively small size in the absence of ct-satellite DNA and 
its cloning stability, as indicated by the cosmid, YAK and B AC clones of the HC-contig and NC- 
30 contig. 
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EXAMPLE 14 
Sequence analysis 

Figures 6 16A and 16B show partial nucleotide sequences for the HC-contig (SEQ ID NO: 3) 
5 NC-contig [SEQ ID NO: 4] and F2 (BAC/F2-14) [SEQ ED NO: 5-29] regions, respectively. 

EXAMPLE 15 
Human Artificial Chromosome (HAC) 

10 The following are examples of the different approaches being used in the inventors' laboratory 
for the production of a HAC: 

TTrtriTfmiiTff ftf Hr-*tP«ff T)NA from normal chromoiojne_lfl 

15 This procedure aims to produce HACs of 100 kb to >lMb using the region of the normal 
chromosome 10 containing and surrounding the HC-contig DNA. The generation of a HAC by 
this approach will provide crucial proof that this normal DNA region can be reactivated to form 
a functional centromere. 

20 A retrofitting procedure suitable for introducing human telomeres to both ends of any YAC 
prepared in the pYAC4 vector in the yeast host strain AB1380 has been previously described 
(Larin el al, 1994; Taylor * al. 1994. 1996). YACs (in particular YAC-3 and YAC-5) spanning 
the normal HC-contig region are used for retrofitting by plasmid constructs designed to 
recombine with their P YAC4 vector arms (Figure 7). The construct pLGTEL 1 is used to target 

25 the left arms of the YACs. This serves to add a LYS2 yeast selectable marker, gpt element for 
ultimate selection in mammalian and avian cell culture, and a human telomere. The right arm of 
the YACs are targeted by homologous recombination with pRANT 11 to produce a final 
construct where additional markers are introduced along with a second human telomere to cap 
the construct. Specifically, an ADE2 yeast marker is added and the URA3 gene of the YAC is 

30 disrupted, serving a useful role in negative selection of the construct. A neomycin (neo) 
resistance gene shown to function in mammalian and avian cells is also introduced. The finished 
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constructs are transfected into different cultured cell lines, including HT1080 (of human sarcoma 
origin) (Larin et aL. 1994; Rasheed et al. 1974), DT40 (a recombination-proficient clucken ceU 
line) (Dieken et al. 1996), and BE2CM8-5f (a human/hamster somatic hybrid cell line 
containing the mardel (10) chromosome but not the normal chromosome 10). 

5 

/ 0 _ K to^ toninc nf Hf-rf pi nn into YAfTHAC vectors 

The different vectors used for the cloning of the normal and mardel (10) centromeric DNA in the 
preparation of HACs are summarised in Table 2. 

10 

A number of different YAC cloning strategies are employed: 

Conventional YAC cloning approach. Figures 8A-D show the different vectors used for 
cloning DNA as YACs by the conventional restriction/ligation methods. These YACs can then 
1 5 be shuttled into mammalian cells and tested for HAC function. 

ALU-ALU circular TAR cloning approach. Transformation-associated recombination (TAR) 
in the yeast S. cerevisiae, is a method for constructing linear and circular YACs from rnamrnahan 
DNA (Larionov et al. 1996a, 1996b). The recombination process is shown in Figure 9. Briefly, 
20 the technique involves the use of a vector (pVC39-AAH2, Fig. 8E) lacking an autonomous 
replicating sequence (ARS) but containing a functional yeast centromere (e.g. CEN6) and 
selectable marker (e.g. HIS3), and two ALU DNA hooks to trap rnatnmalian DNA by 
recombination at ALU sequences after co-transformation of linearized vector and high molecular 
weight DNA into yeast spheroplasts and followed by selection on medium lacking histidine. The 
25 key to the process is that the mammalian DNA provides an ARS (ll-b P sequence found 
frequently in mammalian DNA) which allows the HISVCEN vector to replicate as a circular 
YAC. These YACs are very stable and range in size from 100 kb to greater than 600 kb 
(Larionov et al, 1996b). 

30 pVC39-AAH2 vector is used to clone DNA from hybrid BE2CI-18-5f to make YACs with an 
average insert of 250 kb. This TAR vector is further modified to create pAAH-TCNa (Fig. 8G) 
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so that it has the Mty to shut* bemeen yeast and mammalian cells (as outlined in Figure 10). 
»»*• the t~m » expos, hun-a» .Co™- (TEL) at each end of a clo-ed fragment «, 

a unique restriction site I-Scel. 

5 Semi-specific on* specific circular TAR. A modified circular TAR method utilising two 
specific 5<C and 3'C DNA hooks (300-700 bp in size) may be used to clone a specific human 
DNA at a frequency of 3/1000 HIS* transform^. The inventors prepared the vectors P VC39- 
ALU/C3-F2( + /-) and P TCN-TCS (Table 2) to perform semi-specific and specific TAR cloning. 

respectively. 

1 ° The Semi-specific TAR methodology is a modification of a specific circular TAR strategy which 
permits the site directed isolation of target chromosomal DNA. Furthermore, in accordance with 
the preset invention, the methodology described herein enables the site-specific cloning of target 
chromosomal DNA from total genomic DNA as a circular YAC at relatively high frequences and 

15 without the need for the construction and extensive screening of complex libraries made from 
genomic DNA. 

In a preferred embodiment of the present invention, the methodology employs a single specific 
DNA hook which flanks the mardel (10) chromosome and a less specific Alu-hook to trap the 
20 other side of the target DNA. 

In initial experiments, a unique repeat DNA-free. 1 .4kb EcoRI fragment (designated C3-F2) was 
identified from the p< side of the 80-kb HC-contig (Fig. 11 A) (du Sart et al, 1997). Thxs 
fragment was subclone* into the centxomere-based yeast circular TAR vector, P VC39-AAH2. 
25 by replacing the existing BLUR13 Alu (Larionov etal, 1996b) to create the pVC39-ALU/C3-F2 
constructs. As the specific orientation of the C3-F2 sequence on the chromosome was not 
known, the fragment was cloned in two different orientations, for which the ( + ) orientation (Fig. 
1 IB) was expected to trap the genomic region to the left of C3-F2. while the (-) orientation was 
expected to trap the region to the right. Both constructs were used in yeast transformation. 

30 

As a source of genomic DNA containing the neo^entromere, a somatic hybrid cell line, BE2C1- 
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1 Wf (du Sart et oL 1997), containing the mardel 10 chromosome but not the normal human 
chromosome 10 was used. 5ug of high-molecular-weight DNA from this cell line and lug of 
p VC39-ALU/C3-F2(+) or pVC39-Alu/C3-F2(-) (linearized with Sm* to expose the 0.21-kb Ala 
and 1 4-kb C3-F2 hooks) were co-transformed into 10' (previously prepared and stored frozen) 
5 spheroplasts of S. cerevisiae YPH857 which carries a HIS3 gene deletion, (Sikorski and Hieter, 
1989) and grown on SD, without HIS medium, (Larionov et aU 1996a;b) to yield between 10 
and 100 HIS* colonies. Control experiments in which YPH857 was transformed with vector 
alone did not produce any colonies, indicating that the C3-F2 fragment lacked ARS-hke 
sequence, Twenty TAR experiments were performed and HIS* colonies were picked into 96- 
10 well trays containing YPD medium (supplemented with 50ug/ml ampicillin and 15ug/ml 
tetracycline), grown at 30»C with aeration for 24h and stored in 20% (v/v) glycerol at -70«C. 
Total yeast DNA was prepared in pools of 48 (Kwiatkowski jr et al, 1990) and screened by 
PGR with the primers norm 5 and norm 7 (Table 3) which are located 30-kb q' of C3-F2 (F lg . 
UA) Two desired positive clones, designated 5f-52-E8 and 5f-38-F2, which contained the neo- 
15 centromere DNA derived from mardel 10 and mardel (10) and the DNA immediately p' of the 
neocentromeric DNA, respectively, were identified. For subsequent studies, these clones were 
grown on SD without HIS medium and single colonies were re-isolated for characterization. 

Initially, the sequence nature and sizes of the 5f-52-E8 and 5f-38-F2 insert DNA were 
20 determined. High-molecular-weight DNA was prepared in agarose blocks and digested with an 
enzyme (Srfl) that linearized with YAC (Fig. 1 1 A). The linearized DNA, as well as uncut intact 
DNA. were resolved by pulsed-field gel electrophoresis (PFGE), transferred onto a nylon 
membrane and probed with radiolabelled PAC4, a Pl-derived artificial chromosome clone 
containing a 120-kb insert that spans the entire HC<ontig from normal chromosome 10, (du Sart 
25 et al., 1997) following preanneaUng with human placental DNA to suppress repetitive DNA. 
The intact 5f-52-E8 and 5f-38-F2 remained trapped in the electrophoretic wells and the 
linearized DNA migrated into the gel and demonstrated a size of approximately 1 10 kbp and 80 
kbp, suggesting insert sizes of about 105 kbp and 75 kbp, respectively (given that the vector saze 
is 5.9 kb). 

30 

Despite the use of a genomic DNA source previously shown by sequence-tag-site (STS) analysis 
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to be free from normal chromosome 10 material, it is desirable to independently confirm the 
mardel (10) -origin of the 5f-52-E8 YAC clone. This was achieved using a set of primers (norm 
17 and 18; Fig. 1 1A) that detected a variable-number-tandem repeat (VNTR) region within the 
HC-contig/neocentromere region. The results clearly indicated the presence of a 1.4-kb PCR 
5 product that was specific for the mardel (10) chromosome (Table 3). 

PCR was used to further compare the 5f-52-E8 DNA with the previously cloned HC-contig 
sequence derived from normal chromosome 10. PCR products with sizes ranging between 0.2 
and 15.9 kb were generated by standard PCR or with the Expand Long Template PCR system 

10 (Boehringer-Manneheim). Products greater than 1 kb were digested with frequent cutting 
enzymes, Rsal and BsiXI, and their fingerprints were compared by agarose gel electrophoresis. 
The results, shown in Table 3, indicated the absence of any detectable difference between the 5f- 
52-E8 DNA and those of the corresponding regions of the normal chromosome 10 (in somatic 
cell hybrid BE2C1-18-10 and the neocentromere region of mardel (10) (in somatic cell hybrid 

15 BE2Cl-18-5f). These results also demonstrated that the YAC 5f-52-E8 spanned at least 75 kb 
of the HC-contig region (Fig. 1 1Q, consistent with the size determined by PFGE. Furthermore, 
the ability of all the internal primers to amplify DNA from 5f-52-E8 strongly suggested that the 
YAC was not chimeric. This result was confirmed by isolating DNA from four single-colony 
isolates of 5f-52-E8, digesting these with EcoW and EcoRV, and probing with radiolabeled 

20 PAC4. The hybridization patterns obtained with these enzymes were consistent with those 
established in the previous study (du Sart et aL, 1997). Thus, this analysis, based on cloned 
DNA derived directly from mardel 10, has provided confirmation that the neocentromere DNA 
region is structurally identical to that of the corresponding HC-contig region of the normal 
chromosome 10 (du Sart etal, 1997). 

25 

The circular YACs 5f-52-E8 and 5f-38-F2 were further retrofitted with the yeast-bacterial- 
mammalian cells shuttle vector BRV1 as previously described (Larionov et aL. 1997). The 
resulting BAC clones were designated BAC/E8-1 and BAC/F2-14, respectively (Fig. 11D). 

30 The specific TAR strategy is oudined in Figure 12 and uses unique fragments from the HC- 
contig region, such as the ends of PAC4 (a 120 kb-insert PAC clone containing the HC-region) 
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t0 create the YAC/HAC shutue vector pTCN-TCS. An «amp.e of a YAC/HAC construe, 
coining the HC-contig region of normal '0 * *« >" F '6 13 ' 

Completed constructs arc transfected into different cultured marrunalian or chicken ceUs (see 
5 above) by lipofection using Traasfectam or DOSPER. 

, n ■v^in ^ " of Hr- rcfion i nto HftC vectors 

This strategy envoys a Unique Known as Telomere Associated Chromosomal Truncation 
,0 (TACT, (Fig. i4>. The technique is based on the principle that cloned mammalian teiomenc 
DNA when reintroduced into a mammalian cell can seed the formation of a new telomere a. an 
intrachromosomal location. U the introduced tefcmeric DNA is targeted to a known site through 
homologous recombination, integration a. that location and subsequent truncation of d-U 
sequencesonuteongWchorr.^sor^laxmcanresu.tffrown^^. 1994; Faxr *, al, 1995). 
,3 •nus^chmqueis.mp.oyedinourownsnrdy.otnrncatethemarde! 10 chromosome onenher 
side of me HC-contigfcore centromeric DNA element to produce in vivo a stable HAC of 
minimal size. 

Figure 15A shows an example of TACT-constmct used in our study. Key features of this 

20 construct are: (a) Cloning of the pericenti* human genomic DNA in both orientations (*-). Thrs 
1 s»ecess^s i ncewedon« 1 = M wt lB chromosoma 1 orie„uuonoftoDNA.TmsDNA.su S e < l 

,o target the human telomeric sequences .0 locations on either side of the HC-comig region on 
mardel 10. Genomic DNA is derived from several different sources including Y2C24. Y3C64. 
Y3C109 Y3C94. Y13C12, Y13C15, Y17C6, Y17C8. The resulting truncation denvauves 

25 produced using these genomic DNAs will vary in size accordingly, (b) The termini contain 2 4 
Abases of tandem repeat human telomeric DNA 0*0. This DNA has been shown previous., 
,„ « as a subsume for mammalian telomerase to allow seeding of a complete telomere tens of 
kilobases in lengu, (c) The hygroma (Hyg) resistance gene allows for positive setection of 
mammalian cell lines containing construct sequences integrated into the genome. Tins ts the 

30 initial screening procedure. In addition, some constats contain the neomycrn 
phosophotransferase gene (Neo) rather than Hyg. (c) The Herpes simplex thymidine kinase (TO 
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• . ™ v-nmnlopous integration events into the genome, 
gene is used for negative selection against non homologous miegra 

• • u tvt a * nt . ran be selected against by adding the nucleoside 
Those ceU lines containing the TK gene can be seiccico e / 

analogue gancyclovir. 

i ~t tapt construct used in our study. In addition to the 
5 Figure 1 5B shows another example of TACT-construct useo u» 

features of the lineaxised construct shown in Fig. 15 A, specific additional features are: (a) The 
incorporation of tandem telomeric blocks (htel.htel) since Others have shown these to have the 
highest seeding efficiency of new telomeres in mammalian cell, (b) The incorporation of yeast 
selectable marker (eg. URA3), DNA origin of replication (eg. ARS). and centromere (eg. 
10 CEN6) to ^«rtrio^of-.^«n^^ , - D »- tTto 
should facilitate further characterisation and manipulation, such as the introduction of therapeutic 
genes for gene therapy purpose, (c) The relocation of the TK gene adjacent to the genomic 
DNA to increase the effectiveness of the negative selection system, (d) The human growth 
hormone (GH) gene has been included to allow proof of principle that human genes can be 
1 5 introduced into a MAC and expressed under the control of endogenous regulatory elements. Tins 
is essential for gene therapy applications of the resulting HAC. (e) A CMV promoter upstream 
of a Pi phage loxP site (CMV/loxP) has been included to allow introduction of large human 
genes into a HAC in Wv*. A plasmid containing a gene of interest, a second loxP site and a 
promoters selectable marker gene is introduced into a mammalian cell line containing the 
20 HAC. Transient expression of CRE recombinase results in recombination between the two loxP 
sites within the cell, thereby integrating the introduced plasmid into the HAC and ptaong the 
selectable marker gene next to the CMV promoter to allow for marker selection. 

For chromosomal truncation, the above TACT-constxucts are transfected into a somatic cell 
25 hybrid line BE2CI-18.5f containing the mardel (10) chromosome. Positive selection is applied 
for Hygromycin or Geneticin resistance whereas negative selection is applied against the 
Thymidine Kinase Gene. Resulting colonies are further screened with distal p< and q< DNA 
fragments to ascertain the presence or absence of the two mardel 10 chromosome arms. In 
addition to the BE2CI-18-5f cell line, a human/chicken somatic cell hybrid line (derived from the 
30 recombination-proficient DT40 chicken cell line; Dieken et al, 1996) containing the mardel (10) 
chromosome will also be generated and used. 
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EXAMPLE 16 
Analysis of HAC 

respective of which of the approaches described above is used, the presence of a new product 
in a mammalian cell line as an extrachromosomal. artificial chromosome, will be assessed by 
fluorescence in situ hybridisation (FISH) analysis, as well as tested by extracting high molecular 
weight DNA to determine independently existing chromosomal entity on pulsed field gel. The 
stability of the construct through successive cell division, both in the presence and absence of 
drug-resistance selection, will be determined. The presence of the construct, in all or a fcgh 
10 percentage of the original transfected cells indicates stability. Demonstration of this stabmty 
indicates the successful creation of a HAC. 

EXAMPLE 17 
Production of HAC 

15 This example describes the use of the neocentromere as a source of centromeric DNA in the 
"bottom-up" approach to produce HACs in human cell culture. Bacterial artificial chromosomes 
(BACs) containing cloned neocentromeric DNA and a selectable marker were co-transfected 
with human telomeric DNA into human HT1080 cells to yield independent HACs that were 

20 single-copy and stable in the absence of selection. The properties of these HACs. and their 
potential utility as a new, improved vector system for gene therapy are described. 

EXPERIMENTAL PROTOCOL 

25 Preparation of DNA. Highly-purified BAC DNA was prepared using Qiagen columns according 
to the manufacturer's instructions. Prior to transfection. BACs were linearized with SgrAl in the 
presence of 2.5 mM spermidine and examined by pulsed-field gel electrophoresis. Human 
telomeric DNA was gel-purified as a 1.6-kb BamUVBgni fragment from P SXneo270T2AG3 
(Blanche */.. 1997). High-molecular-weight genomic DNA was prepared from cultured cell 

30 lines using standard methods (du San et al, 1997). 
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^74) was performed the OOPSER .p— — T « <~ 

r ,™ 6 well trays (each well is 962 mm 2 ) were seeded with 
. . x TVv > ^av before transfecuon, o-weli trays ^a^i* 

I N aCl) supplemented with 0.075 tnM spermidine and 0.030 spemune. These 
^1 werfnW with 50 m of 0.4 ^ OOPSER (diluted in HBS, and left a, room 
ZL for >5 to 20 * The rTTlOSO cells we, — with PBS (phosphate buffere 

lempcidiui^ p flB w medium^ was placed in each 

10 a l in e)and.nJof^n>n«DMEMO>.lb«o'smoa 1 fiedEaglesn,ed 1 um) P 

^ The DNA-DOPSER mixture was then added dropwise with swirling and the ceils were 

trays (each well is 100 mm 2 ) containing DMEM-10% . 
„ (C41S. Cibco-BRX> a, 250 ^ The msdU was changed ever, 3 ,0 4 day, 

gaining ceUs were analysed by fluorescence .» hybridization (FISH). 

20 CeU «*«. and .tafe HT1080 ceUs .« grown in DMEM suppled with 

W* v/v FCS. penicillin/slreptomycin. and gluumine. The mitotic stability ""^""^ 
cto „es was detennined by growth in 25 cm> flas* in the presence (200-250 ^ or absen e 
of G41S se.ec.ion. and grown to connuency (3. days, and split 1,5 and 1/10, respecuvel 

25 with BAC/E8 and/or BAC/F2 probes. 

nsa ANTI-CEN/FISH and PRINS/FISH. Fluorescence in siru hybridization (FISH) analysis 

. « - t i r, Ar /FR BAC/F2 and/or a-satellite DNA probes, 
of HT 1080 clones was performed with BAC/E8, BAL/r/, ana/ 

„ y hrtdizauon using the BAC probes were performed under high suingency whereas 
30 satellite DNA probes were used in .ow stringency condidons (du Sar. « aL 1997). ANT. 
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CEN/FISH analyses involved . inula, immunofluorescence staining step "^REST 
CEIWlsn J • t rrNP B CENP-C or CENP-E. followed by FISH using 

antibody or specific antibotes against CENP-B. CfcW L. 

the probes described above. essen.ia.ly as previously described (du San « at.. .997). 



5 Results 



HAC cnsttucdon strategy. The basic strategy involved ,he co-transfeotion of the 10,25.2 
neocentromere DNA with human telomeric DN A into human cells. The neocentromere r ^*°^ 
is cioned as two. circtuar VACS in Socles To facd.tate tart* -d 

10 purificadon of .be cloned DNA in large pities, these VACs are re.ofit.ed «. BAC and 
gained epUornaUy in* «* as circular molecu.es. One of me B AC Cones. BAOE8. » 120 

<F ig ,6). The seeond BAC Cone. BAC/F2, has an inser. size of 75 kb that overlapped B AOE8 
L I 4 kb and contains - .0 kb of the core NC-DN A while extending -65 kb in.0 Che p -s.de of 
,5 the marde. (10, chromosome (Fig. i6). The BAC vecor backbone further cc^nsthe 
oeomycin-resistauce (Ne.«) gene ,o auow selection in mammalian ceil, BAC/E8 and BAOF2. 
used either on ,heir own. in combination with each other or with additional DNA are usen in the 
following transfection experiments. 

20 Transfection of HT 108 0 cells. The human cell line HT1080 (Rasheed « n,.. 1974, is chosen 
for .he transfection av ^ S ^om^^^^^' l ' tKi ^ 
activity (Holt e, at., 1997,. and its demounted ability ,0 form microchromosomes contanung 
* novo cen.romeres from transfeced arrays of a-satellne DNA and human te.omenc DNA 
(Harrington e,oL, 1997; Dceno e, al. 1998). The resulting 0418-resis^n. clones are analysed 

25 by FISH and classified into different categories of events. 

Transtech cel. lines are designated HT-38. HM7. KT-54. HT-.90. and HT-191 . 

Those skU.ed in ft. art wdl appreciate tha. the invent described herein is suscepub.e to 
30 variation and modifications other than those specLficairy described. I. is to be understood that the 
invention indudes aU such variations and modifications. The invention also includes aU of the 




sl eps. features, ccnposiuons and compel .fen* .0 or indie- * «. 

features. 
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TABLE 1 

. nMinf oatient BE and those of his parents using three 
Restriction analysis of the genomic DNA of patient bc an 

overlapping cosmids that span the marker centromere. 



Y7C14 



Y6C10 



Y4C7 



NotI 


n.a. 


BssHlI 


n.a. 


BsiWI 


n.a. 


Sail 


410 


Clal 


315, 145,110, 80 


SnaBI 


n,a. 


Nael 


240,210, 155, 120 


Narl 


222, 108,70 


EclXI 


180 


Sfil 


170 


Kspl 


168 


Aatll 


165, 146 


Nhel 


38 


BstBI 


n.a. 


Smal 


n.a. 


Bgll 


25 


Pad 


n.a. 


BamHI 


24, 19, 1J 


Kpnl 




Bell 


21 


PsU 


9.4,5.9,5.1,4.2,3.8, 




3.3, 2.9. 2.4 


Xbal 


14 


Eael 


n.a. 


SphI 


16,7.5 


PvuII 


14,7.5 


Hindn 


8.6, 6.9, 6.2. 2.7.18 




1.2 


Apal 


15, 8.5 




11,4.3.3.9. 1.9, 1-5 


EcoRI 




Hpall 


5.5,4.3, 3.6. 1.6 



910 

815.340 

740 

410 

315. 145. 110,80 
250, 148 

240.210. 155. 120 
222, 108 



910 
n.a. 
740 

410, 540 

315, 145, 110,80 

n.a. 

240, 210, 155. 120 
222, 200, 108, 70 



180 


180 


170 


170 


168 


168 


165. 146 


165, 146 


38 


38 


35 


35 


90. 40, 22 


90, 40. 22 


25. 7.2. 6.2 


25 


25 


n.a. 


24, 22* 


24, 22* 


23 


23, 19 


21 


21 


9.4, 3.8, 2.9, 2.7, 2.4. 


9.4,7.1,4.2,3.3,2.9. 


1.5. 1-1 


2.7. 1.9, 1-5, l.l 


14, 10 


10 


15. 12.8,6 


n.a. 


16 


16,9 


7.5,6 


7.5.6 


6.9,6.2.5.6. 5.2. 5,2.7. 


6.2. 5.6, 5.2, 4.3, 2.9, 


1.9, 1.8, 1.7, 1.2, 0.6 


1.7, 1.2 



15 

11.4, 3,2. 1.9, 1.7, 1.5 
6.9, 3.6.2.8. 1.6, 1.2 



15 

10.2,7.6.3,2, 19. 1.7. 
1.5 

3.6, 2.8, 2.5. 1.6. 1.2 
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Mspl 



3 9 3.0.2.8.2.5.2,1.6. 3.9.3.6.2.8.2.5.2.2. 3.6.3.2.2.8.2.5.2.2. 

1.6.1.5.1.3.1-2.0.9 1.6,1.5,1-2.1 
10 



1.2 

n.a 

n.a. lv/ 

_ „ n.a 

XhoII 7.5 na - 

7.5 75 



Sspl 
Xho 

BgUl 8.5. 6, 5. 4.7. 3,. 2.5 Z. 4.7. 2.5. ..6. ..5. , 7. 6, 5. 4.7. 2.5, I A 

AvaH 7.4,3.7.3.4.2.8.2.6. 3.7,2.8,2.6,1.8.1.7. 4.3'. 3.7', 2.8. 2.6, 1.8. 

1.8.1.7.1.4.1.2.1.1 1.4,1.2.1.1,0.9.0.8. 1.7.1.4.1.2 

0.5 

S'»I 12.5, 8, 7.5 1«. 9. 8.5 ^7 4 1 3 1 25 23 

HindlH 6.6.5.4,4.7.4.4,2.9. 5.4.7,4.4.4.1.2.9.2.5. 5.4.7,4.1.3.1.2.5,2.3. 

2.5 0-7 W 

. „ data not avai.abie.The values represent restriction fragment lengths in Wlobases. Multiple 
values for an enzyme denote different bands detected by a cosmid probe on a gel lane. Stnce 
there were no detectable differences between the DNA of patient BE and those of his parents 
i. any of the fragments (except for a BamHI polymorphic band found in one of the parents, 
indicated by an asterisk), only one set of values is shown for all three genomic DNA. 
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TABLE2 



Table 2 Vectors for cloning centromeric regions from normal chromosome 10 or marde. w 
DNA into yeast artificial chromosomes (YACs). These YACs can be shuttled into rnammauan 

cells to test for function as HACs. 



Vector: Key Feature(s) Map 



pJS97ARTi hTEL/I-Scel/yTEL, DHFR RgAA 

pJS98ANTi hTEL/I-Scel/yTEL, neo Fig-8B 

Fragmentation 1 hTEL/I-Scel/yTEL, hyg *&* C 

Fragmentation 2 (-/+ hGH) hTEL/l-Scel/yTEL, neo. hGH F lg .8D 

pVC39-AAH2 ALU-ALU TAR vector ^S- 8E 

pTEL/CAT/TEL hTEL/I-Scel/hTEL/neo Rg^F 

pAAH/TCNa TAR vector with hTEL/I-Scel/hTEL/neo Fig.8G 

pVC39-ALU/C3-F2(+/-) ALU-specifc TAR vectors Fig.8H 

pXCS ends of PAC4 in pBS Fi S- 81 

dTCN-TCS specific TAR vector hTEL/I-Scel/hTEL/neo Fig.SJ 



TABLE 3 

PCR analysis of YAC Sf-52-E8 done and comparison with the HC-contig/neo- 
centromere region from normal chromosome 10 and mar del (10) 



Primer-Pairs 1 



BE2CM8-lf 



r.Pnnmic DNA "corf in PCB (product si ze in kb) 

* BE2C1-18-Sf b YAC5f-52-E8 



norm: 141+55 


1 RO 


1.80 


not present 


norm: 32 + 30 




0.90 


0.90 


norm: 28 + 29 


1 00 


1.00 


1.00 


norm: 1 + 3 


7 90 


2.90 


2.90 


norm: 39 + 52 


X.iSJ 


1.20 


1.20 


norm: 5 + 7 


023 


0.23 


0.23 


norm; 16 + 5 


3.50 


3.50 


3.50 


norm: 9+ 14 


0.90 


0.90 


0.90 


norm: 36 + 37 


2.00 


2.00 


2 00 


norm: 168 + 7 1 


4.00 


4,00 


4.00 


norm: 27+10 


15.90 


15.90 


15.90 


norm: 18+ 17(VNTR) C 


1.20 


1.40 


1.40 


norm: 68+17 


8.00 


8.00 


8 00 


norm: 34 + 47 


3.00 


3.00 


3.00 


PAC4t7: a + b 


030 


0.30 


not present 


AFM259xg5: ca + gt c 


0.21 


0.19 


not present 



# • 



'Refer to Fig. la for the relative positions of each primer-pair. 

b BE2Cl-18-lf and BE2Cl-18-5f are somatic hybrid cell lines containing the normal human 
chromosome 10 and mar del (10), respectively (2). 

'The 'norm: 18 + IT and 'AFM259xgS: ca and gf primer sets allow distinction between the 
normal human chromosome 10 and mar del (10) by detecting a VNTR and a microsatellite, 

respectively. 
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